Nonlinear POMDPs for Active State Tracking with Sensing Costs
نویسندگان
چکیده
Active state tracking is needed in object classification, target tracking, medical diagnosis and estimation of sparse signals among other various applications. Herein, active state tracking of a discrete– time, finite–state Markov chain is considered. Noisy Gaussian observations are dynamically collected by exerting appropriate control over their information content, while incurring a related sensing cost. The objective is to devise sensing strategies to optimize the trade–off between tracking performance and sensing cost. A recently proposed Kalman–like estimator [1] is employed for state tracking. The associated mean–squared error and a generic sensing cost metric are then used in a partially observable Markov decision process formulation, and the optimal sensing strategy is derived via a dynamic programming recursion. The resulting recursion proves to be non–linear, challenging control policy design. Properties of the related cost functions are derived and sufficient conditions are provided regarding the structure of the optimal control policy enabling characterization of when passive state tracking is optimal. To overcome the associated computational burden of the optimal sensing strategy, two lower complexity strategies are proposed, which exploit the aforementioned properties. The performance of the proposed strategies is illustrated in a wireless body sensing application, where cost savings as high as 60% are demonstrated for a 4% detection error with respect to a static equal allocation sensing strategy.
منابع مشابه
POMDP Structural Results for Controlled Sensing
Structural results for POMDPs are important since solving POMDPs numerically are typically intractable. Solving a classical POMDP is known to be PSPACE-complete [40]. Moreover, in controlled sensing problems [16], [26], [10], it is often necessary to use POMDPs that are nonlinear in the belief state in order to model the uncertainty in the state estimate. (For example, the variance of the state...
متن کاملEvaluating POMDP rewards for active perception
One popular approach to active perception is using POMDPs to maximize rewards received for sensing actions towards task accomplishment and/or continually refining the agent’s knowledge. Multiple types of reward functions have been proposed to achieve these goals: (1) state-based rewards which minimize sensing costs and maximize task rewards, (2) belief-based rewards which maximize belief state ...
متن کاملADAPTIVE FUZZY TRACKING CONTROL FOR A CLASS OF NONLINEAR SYSTEMS WITH UNKNOWN DISTRIBUTED TIME-VARYING DELAYS AND UNKNOWN CONTROL DIRECTIONS
In this paper, an adaptive fuzzy control scheme is proposed for a class of perturbed strict-feedback nonlinear systems with unknown discrete and distributed time-varying delays, and the proposed design method does not require a priori knowledge of the signs of the control gains.Based on the backstepping technique, the adaptive fuzzy controller is constructed. The main contributions of the paper...
متن کاملHilbert Space Embeddings of PSRs
Many problems in machine learning and artificial intelligence involve discrete-time partially observable nonlinear dynamical systems. If the observations are discrete, then Hidden Markov Models (HMMs) (Rabiner, 1989) or, in the control setting, Partially Observable Markov Decision Processes (POMDPs) (Sondik, 1971) can be used to represent belief as a discrete distribution over latent states. Pr...
متن کاملRobust Tracking Control of Satellite Attitude Using New EKF for Large Rotational Maneuvers
Control of a class of uncertain nonlinear systems, which estimates unavailable state variables, is considered. A new approach for robust tracking control problem of satellite for large rotational maneuvers is presented in this paper. The features of this approach include a strong algorithm to estimate attitude, based on discrete extended Kalman filter combined with a continuous extended Kalman ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1405.5892 شماره
صفحات -
تاریخ انتشار 2014